Simple Additions, Substantial Gains: Expanding Scripts, Languages, and Lineage Coverage in URIEL+
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Make Sense of Video Analytics by Integrating NVIDIA AI Blueprints
developer.nvidia.com·1d
📊AI Performance Profiling
Flag this post
VISTA Score: Verification In Sequential Turn-based Assessment
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
AstuteRAG-FQA: Task-Aware Retrieval-Augmented Generation Framework for Proprietary Data Challenges in Financial Question Answering
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Reasoning Models Sometimes Output Illegible Chains of Thought
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
LLM-Centric RAG with Multi-Granular Indexing and Confidence Constraints
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Thought Branches: Interpreting LLM Reasoning Requires Resampling
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Patient-Centered Summarization Framework for AI Clinical Summarization: A Mixed-Methods Design
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Relation-Aware Bayesian Optimization of DBMS Configurations Guided by Affinity Scores
arxiv.org·1d
🔧Systems-level optimizations for LLM serving
Flag this post
Identifying the Periodicity of Information in Natural Language
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
FOCUS: Efficient Keyframe Selection for Long Video Understanding
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Independent Clinical Evaluation of General-Purpose LLM Responses to Signals of Suicide Risk
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Aligning Large Language Models with Procedural Rules: An Autoregressive State-Tracking Prompting for In-Game Trading
arxiv.org·5d
🧠Large Language Models (LLMs)
Flag this post
A Quantitative Framework to Predict Wait-Time Impacts Due to AI-Triage Devices in a Multi-AI, Multi-Disease Workflow
arxiv.org·1d
📊AI Performance Profiling
Flag this post
PORTool: Tool-Use LLM Training with Rewarded Tree
arxiv.org·4d
🧠Large Language Models (LLMs)
Flag this post
Can MLLMs Read the Room? A Multimodal Benchmark for Verifying Truthfulness in Multi-Party Social Interactions
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Culture Cartography: Mapping the Landscape of Cultural Knowledge
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Auditing LLM Editorial Bias in News Media Exposure
arxiv.org·1d
🧠Large Language Models (LLMs)
Flag this post
Loading...Loading more...